Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 116147 |
| Missing cells | 68136 |
| Missing cells (%) | 3.3% |
| Duplicate rows | 8369 |
| Duplicate rows (%) | 7.2% |
| Total size in memory | 61.6 MiB |
| Average record size in memory | 556.3 B |
Variable types
| Categorical | 7 |
|---|---|
| Numeric | 11 |
| Dataset has 8369 (7.2%) duplicate rows | Duplicates |
nearest_mrt has a high cardinality: 137 distinct values | High cardinality |
nearest_mall has a high cardinality: 137 distinct values | High cardinality |
Price ($) is highly correlated with Area (Sqft) | High correlation |
Area (Sqft) is highly correlated with Price ($) | High correlation |
tenure_yrs_clean is highly correlated with lease_commencement and 1 other fields | High correlation |
lease_commencement is highly correlated with tenure_yrs_clean | High correlation |
remaining_lease is highly correlated with tenure_yrs_clean | High correlation |
Type is highly correlated with Type of Area | High correlation |
Floor Level is highly correlated with Type of Area | High correlation |
Type of Area is highly correlated with Type and 1 other fields | High correlation |
lease_commencement has 32926 (28.3%) missing values | Missing |
remaining_lease has 32926 (28.3%) missing values | Missing |
Price ($) is highly skewed (γ1 = 70.42264601) | Skewed |
Area (Sqft) is highly skewed (γ1 = 85.0509881) | Skewed |
Reproduction
| Analysis started | 2021-04-01 16:43:49.130533 |
|---|---|
| Analysis finished | 2021-04-01 16:44:35.903815 |
| Duration | 46.77 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| Condominium | |
|---|---|
| Apartment | |
| Executive Condominium | |
| Terrace | 4808 |
| Semi-detached | 2426 |
| Other values (4) | 2615 |
Length
| Max length | 21 |
|---|---|
| Median length | 11 |
| Mean length | 11.3940868 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1323389 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Executive Condominium |
|---|---|
| 2nd row | Condominium |
| 3rd row | Executive Condominium |
| 4th row | Condominium |
| 5th row | Condominium |
| Value | Count | Frequency (%) |
| Condominium | 52828 | |
| Apartment | 39804 | |
| Executive Condominium | 13666 | 11.8% |
| Terrace | 4808 | 4.1% |
| Semi-detached | 2426 | 2.1% |
| Strata Terrace | 1225 | 1.1% |
| Detached | 1055 | 0.9% |
| Strata Semi-detached | 250 | 0.2% |
| Strata Detached | 85 | 0.1% |
| Value | Count | Frequency (%) |
| condominium | 66494 | |
| apartment | 39804 | |
| executive | 13666 | 10.4% |
| terrace | 6033 | 4.6% |
| semi-detached | 2676 | 2.0% |
| strata | 1560 | 1.2% |
| detached | 1140 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 175468 | |
| n | 172792 | |
| i | 149330 | |
| o | 132988 | |
| t | 100210 | |
| e | 89510 | 6.8% |
| u | 80160 | 6.1% |
| d | 72986 | 5.5% |
| C | 66494 | 5.0% |
| r | 53430 | 4.0% |
| Other values (13) | 230021 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1174114 | |
| Uppercase Letter | 131373 | 9.9% |
| Space Separator | 15226 | 1.2% |
| Dash Punctuation | 2676 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| m | 175468 | |
| n | 172792 | |
| i | 149330 | |
| o | 132988 | |
| t | 100210 | |
| e | 89510 | |
| u | 80160 | |
| d | 72986 | |
| r | 53430 | 4.6% |
| a | 52773 | 4.5% |
| Other values (5) | 94467 |
| Value | Count | Frequency (%) |
| C | 66494 | |
| A | 39804 | |
| E | 13666 | 10.4% |
| T | 6033 | 4.6% |
| S | 4236 | 3.2% |
| D | 1140 | 0.9% |
| Value | Count | Frequency (%) |
| 15226 |
| Value | Count | Frequency (%) |
| - | 2676 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1305487 | |
| Common | 17902 | 1.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| m | 175468 | |
| n | 172792 | |
| i | 149330 | |
| o | 132988 | |
| t | 100210 | |
| e | 89510 | |
| u | 80160 | 6.1% |
| d | 72986 | 5.6% |
| C | 66494 | 5.1% |
| r | 53430 | 4.1% |
| Other values (11) | 212119 |
| Value | Count | Frequency (%) |
| 15226 | ||
| - | 2676 | 14.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1323389 |
Most frequent character per block
| Value | Count | Frequency (%) |
| m | 175468 | |
| n | 172792 | |
| i | 149330 | |
| o | 132988 | |
| t | 100210 | |
| e | 89510 | 6.8% |
| u | 80160 | 6.1% |
| d | 72986 | 5.5% |
| C | 66494 | 5.0% |
| r | 53430 | 4.0% |
| Other values (13) | 230021 |
Postal District
Real number (ℝ≥0)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.28724806 |
|---|---|
| Minimum | 1 |
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 10 |
| median | 16 |
| Q3 | 19 |
| 95-th percentile | 27 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.004070996 |
|---|---|
| Coefficient of variation (CV) | 0.4581642798 |
| Kurtosis | -0.7754186386 |
| Mean | 15.28724806 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.2162551496 |
| Sum | 1775568 |
| Variance | 49.05701051 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 17723 | |
| 5 | 8125 | 7.0% |
| 15 | 7759 | 6.7% |
| 18 | 7680 | 6.6% |
| 23 | 7022 | 6.0% |
| 3 | 6753 | 5.8% |
| 14 | 6383 | 5.5% |
| 10 | 6182 | 5.3% |
| 27 | 5745 | 4.9% |
| 9 | 5321 | 4.6% |
| Other values (17) | 37454 |
| Value | Count | Frequency (%) |
| 1 | 1077 | 0.9% |
| 2 | 1010 | 0.9% |
| 3 | 6753 | |
| 4 | 1595 | 1.4% |
| 5 | 8125 | |
| 6 | 5 | < 0.1% |
| 7 | 902 | 0.8% |
| 8 | 1281 | 1.1% |
| 9 | 5321 | |
| 10 | 6182 |
| Value | Count | Frequency (%) |
| 28 | 2960 | 2.5% |
| 27 | 5745 | 4.9% |
| 26 | 922 | 0.8% |
| 25 | 1604 | 1.4% |
| 23 | 7022 | 6.0% |
| 22 | 2591 | 2.2% |
| 21 | 4214 | 3.6% |
| 20 | 3853 | 3.3% |
| 19 | 17723 | |
| 18 | 7680 |
Market Segment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.6 MiB |
| OCR | |
|---|---|
| RCR | |
| CCR |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 348441 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OCR |
|---|---|
| 2nd row | OCR |
| 3rd row | OCR |
| 4th row | OCR |
| 5th row | OCR |
| Value | Count | Frequency (%) |
| OCR | 64755 | |
| RCR | 34582 | |
| CCR | 16810 | 14.5% |
| Value | Count | Frequency (%) |
| ocr | 64755 | |
| rcr | 34582 | |
| ccr | 16810 | 14.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 150729 | |
| C | 132957 | |
| O | 64755 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 348441 |
Most frequent character per category
| Value | Count | Frequency (%) |
| R | 150729 | |
| C | 132957 | |
| O | 64755 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 348441 |
Most frequent character per script
| Value | Count | Frequency (%) |
| R | 150729 | |
| C | 132957 | |
| O | 64755 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 348441 |
Most frequent character per block
| Value | Count | Frequency (%) |
| R | 150729 | |
| C | 132957 | |
| O | 64755 |
Type of Sale
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| Resale | |
|---|---|
| New Sale | |
| Sub Sale | 1588 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.994429473 |
| Min length | 6 |
Characters and Unicode
| Total characters | 812382 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Resale |
|---|---|
| 2nd row | Resale |
| 3rd row | Resale |
| 4th row | Resale |
| 5th row | Resale |
| Value | Count | Frequency (%) |
| Resale | 58397 | |
| New Sale | 56162 | |
| Sub Sale | 1588 | 1.4% |
| Value | Count | Frequency (%) |
| resale | 58397 | |
| sale | 57750 | |
| new | 56162 | |
| sub | 1588 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 230706 | |
| a | 116147 | |
| l | 116147 | |
| S | 59338 | 7.3% |
| R | 58397 | 7.2% |
| s | 58397 | 7.2% |
| 57750 | 7.1% | |
| N | 56162 | 6.9% |
| w | 56162 | 6.9% |
| u | 1588 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 580735 | |
| Uppercase Letter | 173897 | 21.4% |
| Space Separator | 57750 | 7.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 230706 | |
| a | 116147 | |
| l | 116147 | |
| s | 58397 | 10.1% |
| w | 56162 | 9.7% |
| u | 1588 | 0.3% |
| b | 1588 | 0.3% |
| Value | Count | Frequency (%) |
| S | 59338 | |
| R | 58397 | |
| N | 56162 |
| Value | Count | Frequency (%) |
| 57750 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 754632 | |
| Common | 57750 | 7.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 230706 | |
| a | 116147 | |
| l | 116147 | |
| S | 59338 | 7.9% |
| R | 58397 | 7.7% |
| s | 58397 | 7.7% |
| N | 56162 | 7.4% |
| w | 56162 | 7.4% |
| u | 1588 | 0.2% |
| b | 1588 | 0.2% |
| Value | Count | Frequency (%) |
| 57750 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 812382 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 230706 | |
| a | 116147 | |
| l | 116147 | |
| S | 59338 | 7.3% |
| R | 58397 | 7.2% |
| s | 58397 | 7.2% |
| 57750 | 7.1% | |
| N | 56162 | 6.9% |
| w | 56162 | 6.9% |
| u | 1588 | 0.2% |
| Distinct | 19922 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1839632.184 |
|---|---|
| Minimum | 40000 |
| Maximum | 980000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 40000 |
|---|---|
| 5-th percentile | 686000 |
| Q1 | 932000 |
| median | 1254000 |
| Q3 | 1772985 |
| 95-th percentile | 3850000 |
| Maximum | 980000000 |
| Range | 979960000 |
| Interquartile range (IQR) | 840985 |
Descriptive statistics
| Standard deviation | 9365558.147 |
|---|---|
| Coefficient of variation (CV) | 5.090994943 |
| Kurtosis | 5853.210042 |
| Mean | 1839632.184 |
| Median Absolute Deviation (MAD) | 376200 |
| Skewness | 70.42264601 |
| Sum | 2.136677593 × 1011 |
| Variance | 8.771367941 × 1013 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1200000 | 708 | 0.6% |
| 1100000 | 660 | 0.6% |
| 1300000 | 605 | 0.5% |
| 1050000 | 595 | 0.5% |
| 1500000 | 580 | 0.5% |
| 1150000 | 522 | 0.4% |
| 1250000 | 499 | 0.4% |
| 1400000 | 497 | 0.4% |
| 1600000 | 491 | 0.4% |
| 1180000 | 475 | 0.4% |
| Other values (19912) | 110515 |
| Value | Count | Frequency (%) |
| 40000 | 1 | |
| 50000 | 2 | |
| 63000 | 1 | |
| 288000 | 1 | |
| 300000 | 1 | |
| 330000 | 2 | |
| 356000 | 1 | |
| 358000 | 1 | |
| 360000 | 1 | |
| 362000 | 1 |
| Value | Count | Frequency (%) |
| 980000000 | 1 | |
| 970000000 | 1 | |
| 906889000 | 1 | |
| 906700000 | 1 | |
| 840888888 | 1 | |
| 765781819 | 1 | |
| 728000000 | 1 | |
| 638000000 | 1 | |
| 629000000 | 1 | |
| 610000000 | 1 |
| Distinct | 3761 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1349.925302 |
|---|---|
| Minimum | 258 |
| Maximum | 947081 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 258 |
|---|---|
| 5-th percentile | 474 |
| Q1 | 721 |
| median | 1033 |
| Q3 | 1346 |
| 95-th percentile | 2888 |
| Maximum | 947081 |
| Range | 946823 |
| Interquartile range (IQR) | 625 |
Descriptive statistics
| Standard deviation | 6217.933946 |
|---|---|
| Coefficient of variation (CV) | 4.606131864 |
| Kurtosis | 9000.076898 |
| Mean | 1349.925302 |
| Median Absolute Deviation (MAD) | 312 |
| Skewness | 85.0509881 |
| Sum | 156789774 |
| Variance | 38662702.55 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 958 | 1610 | 1.4% |
| 463 | 1602 | 1.4% |
| 915 | 1526 | 1.3% |
| 764 | 1519 | 1.3% |
| 1055 | 1473 | 1.3% |
| 678 | 1419 | 1.2% |
| 1109 | 1416 | 1.2% |
| 700 | 1410 | 1.2% |
| 484 | 1394 | 1.2% |
| 904 | 1349 | 1.2% |
| Other values (3751) | 101429 |
| Value | Count | Frequency (%) |
| 258 | 1 | < 0.1% |
| 323 | 13 | < 0.1% |
| 334 | 28 | < 0.1% |
| 344 | 39 | < 0.1% |
| 355 | 57 | < 0.1% |
| 366 | 111 | |
| 377 | 70 | 0.1% |
| 388 | 117 | |
| 398 | 231 | |
| 409 | 256 |
| Value | Count | Frequency (%) |
| 947081 | 1 | |
| 629263 | 1 | |
| 601040 | 1 | |
| 563829 | 1 | |
| 558565 | 1 | |
| 520407 | 1 | |
| 479439 | 1 | |
| 416750 | 1 | |
| 405114 | 1 | |
| 382919 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.0 MiB |
| Strata | |
|---|---|
| Land | 8310 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.856905473 |
| Min length | 4 |
Characters and Unicode
| Total characters | 680262 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Strata |
|---|---|
| 2nd row | Strata |
| 3rd row | Strata |
| 4th row | Strata |
| 5th row | Strata |
| Value | Count | Frequency (%) |
| Strata | 107837 | |
| Land | 8310 | 7.2% |
| Value | Count | Frequency (%) |
| strata | 107837 | |
| land | 8310 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 223984 | |
| t | 215674 | |
| S | 107837 | |
| r | 107837 | |
| L | 8310 | 1.2% |
| n | 8310 | 1.2% |
| d | 8310 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 564115 | |
| Uppercase Letter | 116147 | 17.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 223984 | |
| t | 215674 | |
| r | 107837 | |
| n | 8310 | 1.5% |
| d | 8310 | 1.5% |
| Value | Count | Frequency (%) |
| S | 107837 | |
| L | 8310 | 7.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 680262 |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 223984 | |
| t | 215674 | |
| S | 107837 | |
| r | 107837 | |
| L | 8310 | 1.2% |
| n | 8310 | 1.2% |
| d | 8310 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 680262 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 223984 | |
| t | 215674 | |
| S | 107837 | |
| r | 107837 | |
| L | 8310 | 1.2% |
| n | 8310 | 1.2% |
| d | 8310 | 1.2% |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.1 MiB |
| 01 to 05 | |
|---|---|
| 06 to 10 | |
| 11 to 15 | |
| - | |
| 16 to 20 | |
| Other values (12) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.402558826 |
| Min length | 1 |
Characters and Unicode
| Total characters | 859785 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 01 to 05 |
|---|---|
| 2nd row | 11 to 15 |
| 3rd row | 11 to 15 |
| 4th row | 16 to 20 |
| 5th row | 16 to 20 |
| Value | Count | Frequency (%) |
| 01 to 05 | 37908 | |
| 06 to 10 | 28128 | |
| 11 to 15 | 20111 | |
| - | 9913 | 8.5% |
| 16 to 20 | 9475 | 8.2% |
| 21 to 25 | 4422 | 3.8% |
| 26 to 30 | 2714 | 2.3% |
| 31 to 35 | 1942 | 1.7% |
| 36 to 40 | 933 | 0.8% |
| 41 to 45 | 319 | 0.3% |
| Other values (7) | 282 | 0.2% |
| Value | Count | Frequency (%) |
| to | 106234 | |
| 01 | 37908 | 11.5% |
| 05 | 37908 | 11.5% |
| 10 | 28128 | 8.6% |
| 06 | 28128 | 8.6% |
| 15 | 20111 | 6.1% |
| 11 | 20111 | 6.1% |
| 9913 | 3.0% | |
| 20 | 9475 | 2.9% |
| 16 | 9475 | 2.9% |
| Other values (24) | 21224 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 212468 | ||
| 0 | 145373 | |
| 1 | 142630 | |
| t | 106234 | |
| o | 106234 | |
| 5 | 65085 | 7.6% |
| 6 | 41535 | 4.8% |
| 2 | 21033 | 2.4% |
| - | 9913 | 1.2% |
| 3 | 7531 | 0.9% |
| Other values (3) | 1749 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 424910 | |
| Space Separator | 212468 | |
| Lowercase Letter | 212468 | |
| Dash Punctuation | 9913 | 1.2% |
| Uppercase Letter | 26 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 145373 | |
| 1 | 142630 | |
| 5 | 65085 | |
| 6 | 41535 | 9.8% |
| 2 | 21033 | 4.9% |
| 3 | 7531 | 1.8% |
| 4 | 1694 | 0.4% |
| 7 | 29 | < 0.1% |
| Value | Count | Frequency (%) |
| t | 106234 | |
| o | 106234 |
| Value | Count | Frequency (%) |
| 212468 |
| Value | Count | Frequency (%) |
| - | 9913 |
| Value | Count | Frequency (%) |
| B | 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 647291 | |
| Latin | 212494 | 24.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 212468 | ||
| 0 | 145373 | |
| 1 | 142630 | |
| 5 | 65085 | 10.1% |
| 6 | 41535 | 6.4% |
| 2 | 21033 | 3.2% |
| - | 9913 | 1.5% |
| 3 | 7531 | 1.2% |
| 4 | 1694 | 0.3% |
| 7 | 29 | < 0.1% |
| Value | Count | Frequency (%) |
| t | 106234 | |
| o | 106234 | |
| B | 26 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 859785 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 212468 | ||
| 0 | 145373 | |
| 1 | 142630 | |
| t | 106234 | |
| o | 106234 | |
| 5 | 65085 | 7.6% |
| 6 | 41535 | 4.8% |
| 2 | 21033 | 2.4% |
| - | 9913 | 1.2% |
| 3 | 7531 | 0.9% |
| Other values (3) | 1749 | 0.2% |
Unit Price ($psf)
Real number (ℝ≥0)
| Distinct | 3346 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1395.469044 |
|---|---|
| Minimum | 33 |
| Maximum | 5896 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 742 |
| Q1 | 1011 |
| median | 1339 |
| Q3 | 1670 |
| 95-th percentile | 2399 |
| Maximum | 5896 |
| Range | 5863 |
| Interquartile range (IQR) | 659 |
Descriptive statistics
| Standard deviation | 519.7054367 |
|---|---|
| Coefficient of variation (CV) | 0.3724234795 |
| Kurtosis | 2.395190266 |
| Mean | 1395.469044 |
| Median Absolute Deviation (MAD) | 329 |
| Skewness | 1.161015815 |
| Sum | 162079543 |
| Variance | 270093.7409 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 929 | 328 | 0.3% |
| 1394 | 257 | 0.2% |
| 1548 | 227 | 0.2% |
| 1161 | 207 | 0.2% |
| 1327 | 175 | 0.2% |
| 1239 | 173 | 0.1% |
| 1486 | 165 | 0.1% |
| 1858 | 163 | 0.1% |
| 1301 | 162 | 0.1% |
| 1000 | 161 | 0.1% |
| Other values (3336) | 114129 |
| Value | Count | Frequency (%) |
| 33 | 1 | |
| 55 | 1 | |
| 59 | 1 | |
| 69 | 1 | |
| 100 | 1 | |
| 109 | 1 | |
| 120 | 1 | |
| 127 | 1 | |
| 130 | 1 | |
| 135 | 1 |
| Value | Count | Frequency (%) |
| 5896 | 1 | |
| 5633 | 1 | |
| 5305 | 1 | |
| 5125 | 1 | |
| 5050 | 1 | |
| 4987 | 1 | |
| 4936 | 1 | |
| 4927 | 1 | |
| 4913 | 1 | |
| 4899 | 1 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 346.7920064 |
|---|---|
| Minimum | 60 |
| Maximum | 900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 99 |
| Q1 | 99 |
| median | 99 |
| Q3 | 900 |
| 95-th percentile | 900 |
| Maximum | 900 |
| Range | 840 |
| Interquartile range (IQR) | 801 |
Descriptive statistics
| Standard deviation | 370.2756669 |
|---|---|
| Coefficient of variation (CV) | 1.067716845 |
| Kurtosis | -1.319813126 |
| Mean | 346.7920064 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8247143536 |
| Sum | 40277464 |
| Variance | 137104.0695 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 79663 | |
| 900 | 35933 | |
| 103 | 236 | 0.2% |
| 60 | 94 | 0.1% |
| 102 | 70 | 0.1% |
| 100 | 57 | < 0.1% |
| 70 | 26 | < 0.1% |
| 110 | 24 | < 0.1% |
| 101 | 22 | < 0.1% |
| 85 | 8 | < 0.1% |
| Other values (4) | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 60 | 94 | 0.1% |
| 70 | 26 | < 0.1% |
| 85 | 8 | < 0.1% |
| 89 | 3 | < 0.1% |
| 93 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 99 | 79663 | |
| 100 | 57 | < 0.1% |
| 101 | 22 | < 0.1% |
| 102 | 70 | 0.1% |
| Value | Count | Frequency (%) |
| 900 | 35933 | |
| 110 | 24 | < 0.1% |
| 104 | 5 | < 0.1% |
| 103 | 236 | 0.2% |
| 102 | 70 | 0.1% |
| 101 | 22 | < 0.1% |
| 100 | 57 | < 0.1% |
| 99 | 79663 | |
| 97 | 1 | < 0.1% |
| 93 | 1 | < 0.1% |
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 32926 |
| Missing (%) | 28.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.87246 |
|---|---|
| Minimum | 1827 |
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 1827 |
|---|---|
| 5-th percentile | 1956 |
| Q1 | 2007 |
| median | 2014 |
| Q3 | 2017 |
| 95-th percentile | 2018 |
| Maximum | 2020 |
| Range | 193 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 28.26883727 |
|---|---|
| Coefficient of variation (CV) | 0.01410006763 |
| Kurtosis | 14.64618096 |
| Mean | 2004.87246 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -3.842975042 |
| Sum | 166847491 |
| Variance | 799.1271604 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2018 | 13139 | 11.3% |
| 2014 | 8043 | 6.9% |
| 2015 | 7360 | 6.3% |
| 2016 | 6049 | 5.2% |
| 2017 | 5054 | 4.4% |
| 2013 | 5026 | 4.3% |
| 2011 | 4311 | 3.7% |
| 2012 | 3753 | 3.2% |
| 2010 | 3383 | 2.9% |
| 2019 | 3113 | 2.7% |
| Other values (80) | 23990 | |
| (Missing) | 32926 |
| Value | Count | Frequency (%) |
| 1827 | 34 | < 0.1% |
| 1835 | 2 | < 0.1% |
| 1841 | 201 | |
| 1874 | 29 | < 0.1% |
| 1875 | 154 | 0.1% |
| 1876 | 190 | |
| 1877 | 470 | |
| 1878 | 185 | 0.2% |
| 1879 | 458 | |
| 1881 | 27 | < 0.1% |
| Value | Count | Frequency (%) |
| 2020 | 117 | 0.1% |
| 2019 | 3113 | 2.7% |
| 2018 | 13139 | |
| 2017 | 5054 | 4.4% |
| 2016 | 6049 | |
| 2015 | 7360 | |
| 2014 | 8043 | |
| 2013 | 5026 | 4.3% |
| 2012 | 3753 | 3.2% |
| 2011 | 4311 | 3.7% |
sale_yr
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2018.059692 |
|---|---|
| Minimum | 2016 |
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 2016 |
|---|---|
| 5-th percentile | 2016 |
| Q1 | 2017 |
| median | 2018 |
| Q3 | 2019 |
| 95-th percentile | 2020 |
| Maximum | 2021 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.445587991 |
|---|---|
| Coefficient of variation (CV) | 0.0007163256852 |
| Kurtosis | -1.1221707 |
| Mean | 2018.059692 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1665534896 |
| Sum | 234391579 |
| Variance | 2.089724641 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2017 | 28366 | |
| 2018 | 23442 | |
| 2020 | 22423 | |
| 2019 | 19480 | |
| 2016 | 19267 | |
| 2021 | 3169 | 2.7% |
| Value | Count | Frequency (%) |
| 2016 | 19267 | |
| 2017 | 28366 | |
| 2018 | 23442 | |
| 2019 | 19480 | |
| 2020 | 22423 | |
| 2021 | 3169 | 2.7% |
| Value | Count | Frequency (%) |
| 2021 | 3169 | 2.7% |
| 2020 | 22423 | |
| 2019 | 19480 | |
| 2018 | 23442 | |
| 2017 | 28366 | |
| 2016 | 19267 |
| Distinct | 168 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 32926 |
| Missing (%) | 28.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.9678807 |
|---|---|
| Minimum | 3 |
| Maximum | 900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 75 |
| Q1 | 90 |
| median | 96 |
| Q3 | 98 |
| 95-th percentile | 723 |
| Maximum | 900 |
| Range | 897 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 151.1697154 |
|---|---|
| Coefficient of variation (CV) | 1.190613835 |
| Kurtosis | 14.59101237 |
| Mean | 126.9678807 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.058403998 |
| Sum | 10566394 |
| Variance | 22852.28285 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 98 | 16218 | |
| 97 | 14104 | |
| 96 | 9052 | 7.8% |
| 95 | 4327 | 3.7% |
| 99 | 3870 | 3.3% |
| 90 | 2688 | 2.3% |
| 91 | 2619 | 2.3% |
| 93 | 2614 | 2.3% |
| 92 | 2514 | 2.2% |
| 94 | 1967 | 1.7% |
| Other values (158) | 23248 | |
| (Missing) | 32926 |
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 4 | 1 | < 0.1% |
| 14 | 7 | |
| 15 | 6 | |
| 16 | 5 | |
| 17 | 4 | |
| 18 | 4 | |
| 21 | 1 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 3 |
| Value | Count | Frequency (%) |
| 900 | 1 | < 0.1% |
| 895 | 1 | < 0.1% |
| 885 | 1 | < 0.1% |
| 879 | 7 | |
| 878 | 13 | |
| 877 | 12 | |
| 876 | 5 | < 0.1% |
| 875 | 4 | < 0.1% |
| 874 | 4 | < 0.1% |
| 873 | 4 | < 0.1% |
| Distinct | 137 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 456 |
| Missing (%) | 0.4% |
| Memory size | 8.6 MiB |
| Clementi MRT Station | 5650 |
|---|---|
| Potong Pasir MRT Station | 4714 |
| Kovan MRT Station | 3725 |
| Hougang MRT Station | 2896 |
| Tampines West MRT Station | 2759 |
| Other values (132) |
Length
| Max length | 29 |
|---|---|
| Median length | 21 |
| Mean length | 20.90818646 |
| Min length | 16 |
Characters and Unicode
| Total characters | 2418889 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Tampines West MRT Station |
|---|---|
| 2nd row | Bedok Reservoir MRT Station |
| 3rd row | Tampines West MRT Station |
| 4th row | Bedok Reservoir MRT Station |
| 5th row | Bedok Reservoir MRT Station |
| Value | Count | Frequency (%) |
| Clementi MRT Station | 5650 | 4.9% |
| Potong Pasir MRT Station | 4714 | 4.1% |
| Kovan MRT Station | 3725 | 3.2% |
| Hougang MRT Station | 2896 | 2.5% |
| Tampines West MRT Station | 2759 | 2.4% |
| Queenstown MRT Station | 2717 | 2.3% |
| Sembawang MRT Station | 2636 | 2.3% |
| Bedok MRT Station | 2506 | 2.2% |
| Dakota MRT Station | 2489 | 2.1% |
| Eunos MRT Station | 2411 | 2.1% |
| Other values (127) | 83188 |
| Value | Count | Frequency (%) |
| station | 115691 | |
| mrt | 102796 | |
| lrt | 12895 | 3.3% |
| pasir | 7614 | 1.9% |
| clementi | 5650 | 1.4% |
| potong | 4714 | 1.2% |
| tampines | 4527 | 1.1% |
| kovan | 3725 | 0.9% |
| bedok | 3265 | 0.8% |
| park | 3005 | 0.8% |
| Other values (169) | 131916 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 280782 | |
| 280107 | ||
| a | 223906 | 9.3% |
| n | 203827 | 8.4% |
| o | 189892 | 7.9% |
| i | 173679 | 7.2% |
| T | 129513 | 5.4% |
| S | 127736 | 5.3% |
| R | 122551 | 5.1% |
| M | 108270 | 4.5% |
| Other values (39) | 578626 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1510768 | |
| Uppercase Letter | 627841 | |
| Space Separator | 280107 | 11.6% |
| Dash Punctuation | 173 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| T | 129513 | |
| S | 127736 | |
| R | 122551 | |
| M | 108270 | |
| P | 21127 | 3.4% |
| L | 19314 | 3.1% |
| K | 17436 | 2.8% |
| B | 15679 | 2.5% |
| C | 13313 | 2.1% |
| H | 8311 | 1.3% |
| Other values (14) | 44591 | 7.1% |
| Value | Count | Frequency (%) |
| t | 280782 | |
| a | 223906 | |
| n | 203827 | |
| o | 189892 | |
| i | 173679 | |
| e | 86761 | 5.7% |
| r | 52944 | 3.5% |
| g | 43926 | 2.9% |
| s | 33607 | 2.2% |
| l | 32243 | 2.1% |
| Other values (13) | 189201 |
| Value | Count | Frequency (%) |
| 280107 |
| Value | Count | Frequency (%) |
| - | 173 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2138609 | |
| Common | 280280 | 11.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 280782 | |
| a | 223906 | |
| n | 203827 | |
| o | 189892 | 8.9% |
| i | 173679 | 8.1% |
| T | 129513 | 6.1% |
| S | 127736 | 6.0% |
| R | 122551 | 5.7% |
| M | 108270 | 5.1% |
| e | 86761 | 4.1% |
| Other values (37) | 491692 |
| Value | Count | Frequency (%) |
| 280107 | ||
| - | 173 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2418889 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 280782 | |
| 280107 | ||
| a | 223906 | 9.3% |
| n | 203827 | 8.4% |
| o | 189892 | 7.9% |
| i | 173679 | 7.2% |
| T | 129513 | 5.4% |
| S | 127736 | 5.3% |
| R | 122551 | 5.1% |
| M | 108270 | 4.5% |
| Other values (39) | 578626 |
nearest_mrt_dist
Real number (ℝ≥0)
| Distinct | 2809 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 456 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 732.1908643 |
|---|---|
| Minimum | 12.31416907 |
| Maximum | 3846.761956 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 12.31416907 |
|---|---|
| 5-th percentile | 125.4554316 |
| Q1 | 356.9170077 |
| median | 643.1972738 |
| Q3 | 981.5382877 |
| 95-th percentile | 1679.890005 |
| Maximum | 3846.761956 |
| Range | 3834.447787 |
| Interquartile range (IQR) | 624.62128 |
Descriptive statistics
| Standard deviation | 490.0188848 |
|---|---|
| Coefficient of variation (CV) | 0.6692502033 |
| Kurtosis | 0.8742301879 |
| Mean | 732.1908643 |
| Median Absolute Deviation (MAD) | 297.4412712 |
| Skewness | 1.005165477 |
| Sum | 84707893.28 |
| Variance | 240118.5074 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 717.8234658 | 3390 | 2.9% |
| 645.7591481 | 1715 | 1.5% |
| 159.9025001 | 1383 | 1.2% |
| 856.8236242 | 1375 | 1.2% |
| 478.5646644 | 1202 | 1.0% |
| 477.3735502 | 1125 | 1.0% |
| 807.2202822 | 1078 | 0.9% |
| 743.2810752 | 967 | 0.8% |
| 262.9989849 | 966 | 0.8% |
| 1403.759201 | 879 | 0.8% |
| Other values (2799) | 101611 |
| Value | Count | Frequency (%) |
| 12.31416907 | 6 | < 0.1% |
| 37.29543873 | 1 | < 0.1% |
| 45.00549768 | 1 | < 0.1% |
| 53.52680652 | 45 | < 0.1% |
| 60.66151122 | 404 | |
| 63.48853814 | 8 | < 0.1% |
| 66.97881847 | 4 | < 0.1% |
| 68.27312857 | 37 | < 0.1% |
| 71.12167461 | 4 | < 0.1% |
| 75.59524915 | 72 | 0.1% |
| Value | Count | Frequency (%) |
| 3846.761956 | 3 | < 0.1% |
| 3373.035761 | 15 | < 0.1% |
| 3308.067679 | 34 | |
| 3292.598781 | 47 | |
| 3124.73284 | 2 | < 0.1% |
| 3103.012649 | 23 | |
| 3051.261891 | 13 | < 0.1% |
| 3021.514002 | 2 | < 0.1% |
| 2994.114799 | 32 | |
| 2894.895812 | 1 | < 0.1% |
| Distinct | 137 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 456 |
| Missing (%) | 0.4% |
| Memory size | 7.9 MiB |
| The Poiz | 7200 |
|---|---|
| The Clementi Mall | 4218 |
| Tiong Bahru Plaza | 3145 |
| Anchorpoint | 3137 |
| Our Tampines Hub | 2902 |
| Other values (132) |
Length
| Max length | 31 |
|---|---|
| Median length | 14 |
| Mean length | 14.0438755 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1624750 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Our Tampines Hub |
|---|---|
| 2nd row | East Village |
| 3rd row | Our Tampines Hub |
| 4th row | East Village |
| 5th row | East Village |
| Value | Count | Frequency (%) |
| The Poiz | 7200 | 6.2% |
| The Clementi Mall | 4218 | 3.6% |
| Tiong Bahru Plaza | 3145 | 2.7% |
| Anchorpoint | 3137 | 2.7% |
| Our Tampines Hub | 2902 | 2.5% |
| Eastpoint Mall | 2703 | 2.3% |
| Heartland Mall | 2412 | 2.1% |
| The Seletar Mall | 2292 | 2.0% |
| KINEX | 2288 | 2.0% |
| Sembawang Shopping Centre | 2197 | 1.9% |
| Other values (127) | 83197 |
| Value | Count | Frequency (%) |
| mall | 26397 | 9.9% |
| the | 18332 | 6.9% |
| plaza | 14789 | 5.6% |
| centre | 13452 | 5.1% |
| shopping | 11335 | 4.3% |
| poiz | 7200 | 2.7% |
| square | 5173 | 1.9% |
| clementi | 4843 | 1.8% |
| serangoon | 4459 | 1.7% |
| point | 4302 | 1.6% |
| Other values (160) | 155102 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 163430 | 10.1% |
| 149693 | 9.2% | |
| e | 136998 | 8.4% |
| l | 118024 | 7.3% |
| n | 111806 | 6.9% |
| o | 84681 | 5.2% |
| i | 81936 | 5.0% |
| t | 75327 | 4.6% |
| r | 73449 | 4.5% |
| h | 48064 | 3.0% |
| Other values (52) | 581342 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1179927 | |
| Uppercase Letter | 280300 | 17.3% |
| Space Separator | 149693 | 9.2% |
| Decimal Number | 13965 | 0.9% |
| Other Punctuation | 862 | 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| P | 36796 | |
| T | 32207 | |
| M | 31130 | |
| S | 30084 | |
| C | 28802 | |
| H | 14721 | 5.3% |
| B | 12993 | 4.6% |
| W | 11221 | 4.0% |
| E | 10926 | 3.9% |
| V | 10692 | 3.8% |
| Other values (16) | 60728 |
| Value | Count | Frequency (%) |
| a | 163430 | |
| e | 136998 | |
| l | 118024 | |
| n | 111806 | |
| o | 84681 | 7.2% |
| i | 81936 | 6.9% |
| t | 75327 | 6.4% |
| r | 73449 | 6.2% |
| h | 48064 | 4.1% |
| g | 44584 | 3.8% |
| Other values (16) | 241628 |
| Value | Count | Frequency (%) |
| 1 | 5601 | |
| 2 | 4109 | |
| 0 | 2039 | 14.6% |
| 8 | 1108 | 7.9% |
| 3 | 625 | 4.5% |
| 9 | 483 | 3.5% |
| Value | Count | Frequency (%) |
| ' | 674 | |
| . | 188 | 21.8% |
| Value | Count | Frequency (%) |
| 149693 |
| Value | Count | Frequency (%) |
| + | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1460227 | |
| Common | 164523 | 10.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 163430 | 11.2% |
| e | 136998 | 9.4% |
| l | 118024 | 8.1% |
| n | 111806 | 7.7% |
| o | 84681 | 5.8% |
| i | 81936 | 5.6% |
| t | 75327 | 5.2% |
| r | 73449 | 5.0% |
| h | 48064 | 3.3% |
| g | 44584 | 3.1% |
| Other values (42) | 521928 |
| Value | Count | Frequency (%) |
| 149693 | ||
| 1 | 5601 | 3.4% |
| 2 | 4109 | 2.5% |
| 0 | 2039 | 1.2% |
| 8 | 1108 | 0.7% |
| ' | 674 | 0.4% |
| 3 | 625 | 0.4% |
| 9 | 483 | 0.3% |
| . | 188 | 0.1% |
| + | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1624750 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 163430 | 10.1% |
| 149693 | 9.2% | |
| e | 136998 | 8.4% |
| l | 118024 | 7.3% |
| n | 111806 | 6.9% |
| o | 84681 | 5.2% |
| i | 81936 | 5.0% |
| t | 75327 | 4.6% |
| r | 73449 | 4.5% |
| h | 48064 | 3.0% |
| Other values (52) | 581342 |
nearest_mall_dist
Real number (ℝ≥0)
| Distinct | 2803 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 456 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 780.4680342 |
|---|---|
| Minimum | 0 |
| Maximum | 3456.159897 |
| Zeros | 1089 |
| Zeros (%) | 0.9% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 131.0174774 |
| Q1 | 452.9956561 |
| median | 711.64741 |
| Q3 | 1025.084616 |
| 95-th percentile | 1627.46941 |
| Maximum | 3456.159897 |
| Range | 3456.159897 |
| Interquartile range (IQR) | 572.08896 |
Descriptive statistics
| Standard deviation | 476.1707895 |
|---|---|
| Coefficient of variation (CV) | 0.6101092788 |
| Kurtosis | 2.025828171 |
| Mean | 780.4680342 |
| Median Absolute Deviation (MAD) | 287.465676 |
| Skewness | 1.044519764 |
| Sum | 90293127.35 |
| Variance | 226738.6208 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 947.6915038 | 3390 | 2.9% |
| 567.0746204 | 1715 | 1.5% |
| 988.7172262 | 1383 | 1.2% |
| 543.0615692 | 1375 | 1.2% |
| 270.5905967 | 1202 | 1.0% |
| 876.335862 | 1125 | 1.0% |
| 0 | 1089 | 0.9% |
| 650.5969117 | 1078 | 0.9% |
| 874.2244184 | 967 | 0.8% |
| 491.6662136 | 966 | 0.8% |
| Other values (2793) | 101401 |
| Value | Count | Frequency (%) |
| 0 | 1089 | |
| 2.204837952 × 1011 | 13 | < 0.1% |
| 1.581505939 × 109 | 2 | < 0.1% |
| 1.581531395 × 109 | 2 | < 0.1% |
| 1.581537041 × 109 | 80 | 0.1% |
| 1.581539195 × 109 | 18 | < 0.1% |
| 1.581539426 × 109 | 10 | < 0.1% |
| 1.581541594 × 109 | 17 | < 0.1% |
| 1.581596948 × 109 | 9 | < 0.1% |
| 2.605476064 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 3456.159897 | 5 | < 0.1% |
| 3303.210357 | 3 | < 0.1% |
| 3278.90118 | 15 | < 0.1% |
| 3258.8172 | 47 | |
| 3239.772084 | 34 | |
| 3210.928366 | 3 | < 0.1% |
| 3124.481215 | 4 | < 0.1% |
| 3124.376689 | 75 | |
| 3096.245167 | 35 | |
| 3067.903789 | 23 | < 0.1% |
nearest_cbd_dist
Real number (ℝ≥0)
| Distinct | 2809 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 456 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9347.902679 |
|---|---|
| Minimum | 121.0838467 |
| Maximum | 20416.51831 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 907.5 KiB |
Quantile statistics
| Minimum | 121.0838467 |
|---|---|
| 5-th percentile | 2356.218549 |
| Q1 | 5624.75336 |
| median | 9541.864938 |
| Q3 | 12914.5806 |
| 95-th percentile | 17043.35729 |
| Maximum | 20416.51831 |
| Range | 20295.43446 |
| Interquartile range (IQR) | 7289.827239 |
Descriptive statistics
| Standard deviation | 4572.110706 |
|---|---|
| Coefficient of variation (CV) | 0.4891055099 |
| Kurtosis | -0.9160661678 |
| Mean | 9347.902679 |
| Median Absolute Deviation (MAD) | 3504.116114 |
| Skewness | 0.1316024064 |
| Sum | 1081468209 |
| Variance | 20904196.31 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 6102.667363 | 3390 | 2.9% |
| 12914.5806 | 1715 | 1.5% |
| 7022.694828 | 1383 | 1.2% |
| 11229.74826 | 1375 | 1.2% |
| 5328.887155 | 1202 | 1.0% |
| 7947.968625 | 1125 | 1.0% |
| 11031.87649 | 1078 | 0.9% |
| 10309.86348 | 967 | 0.8% |
| 15441.38343 | 966 | 0.8% |
| 9655.259237 | 879 | 0.8% |
| Other values (2799) | 101611 |
| Value | Count | Frequency (%) |
| 121.0838467 | 8 | < 0.1% |
| 144.88039 | 185 | 0.2% |
| 179.0365628 | 8 | < 0.1% |
| 227.4113504 | 78 | 0.1% |
| 432.1329599 | 47 | < 0.1% |
| 494.5950567 | 56 | < 0.1% |
| 503.6405818 | 516 | |
| 514.7186173 | 109 | 0.1% |
| 581.1859468 | 27 | < 0.1% |
| 696.979183 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 20416.51831 | 4 | < 0.1% |
| 19970.30264 | 79 | |
| 19962.5822 | 7 | < 0.1% |
| 19880.20896 | 2 | < 0.1% |
| 19859.49574 | 2 | < 0.1% |
| 19833.8367 | 2 | < 0.1% |
| 19513.23349 | 9 | < 0.1% |
| 19486.90189 | 1 | < 0.1% |
| 19437.65674 | 9 | < 0.1% |
| 19421.84705 | 120 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Type | Postal District | Market Segment | Type of Sale | Price ($) | Area (Sqft) | Type of Area | Floor Level | Unit Price ($psf) | tenure_yrs_clean | lease_commencement | sale_yr | remaining_lease | nearest_mrt | nearest_mrt_dist | nearest_mall | nearest_mall_dist | nearest_cbd_dist | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Executive Condominium | 18 | OCR | Resale | 1060000 | 1173 | Strata | 01 to 05 | 903 | 99.0 | 2011.0 | 2021 | 89.0 | Tampines West MRT Station | 1351.020868 | Our Tampines Hub | 1377.881503 | 11514.460859 |
| 1 | Condominium | 16 | OCR | Resale | 1285000 | 1227 | Strata | 11 to 15 | 1047 | 99.0 | 1996.0 | 2021 | 74.0 | Bedok Reservoir MRT Station | 263.129128 | East Village | 1323.301689 | 11255.388399 |
| 2 | Executive Condominium | 18 | OCR | Resale | 900000 | 958 | Strata | 11 to 15 | 939 | 99.0 | 2011.0 | 2021 | 89.0 | Tampines West MRT Station | 1351.020868 | Our Tampines Hub | 1377.881503 | 11514.460859 |
| 3 | Condominium | 16 | OCR | Resale | 1280000 | 1227 | Strata | 16 to 20 | 1043 | 99.0 | 1996.0 | 2021 | 74.0 | Bedok Reservoir MRT Station | 263.129128 | East Village | 1323.301689 | 11255.388399 |
| 4 | Condominium | 16 | OCR | Resale | 1580000 | 2099 | Strata | 16 to 20 | 753 | 99.0 | 1996.0 | 2021 | 74.0 | Bedok Reservoir MRT Station | 263.129128 | East Village | 1323.301689 | 11255.388399 |
| 5 | Executive Condominium | 18 | OCR | Resale | 850000 | 958 | Strata | 01 to 05 | 887 | 99.0 | 2011.0 | 2021 | 89.0 | Tampines West MRT Station | 1351.020868 | Our Tampines Hub | 1377.881503 | 11514.460859 |
| 6 | Executive Condominium | 18 | OCR | Resale | 1088000 | 1130 | Strata | 11 to 15 | 963 | 99.0 | 2011.0 | 2021 | 89.0 | Tampines West MRT Station | 1351.020868 | Our Tampines Hub | 1377.881503 | 11514.460859 |
| 7 | Condominium | 16 | OCR | Resale | 850000 | 893 | Strata | 06 to 10 | 951 | 99.0 | 1996.0 | 2021 | 74.0 | Bedok Reservoir MRT Station | 263.129128 | East Village | 1323.301689 | 11255.388399 |
| 8 | Condominium | 16 | OCR | Resale | 678000 | 527 | Strata | 01 to 05 | 1285 | 99.0 | 2011.0 | 2021 | 89.0 | Bedok North MRT Station | 466.298206 | Djitsun Mall | 1613.952313 | 9911.264731 |
| 9 | Condominium | 16 | OCR | Resale | 1070000 | 1206 | Strata | 11 to 15 | 888 | 99.0 | 1996.0 | 2021 | 74.0 | Bedok Reservoir MRT Station | 263.129128 | East Village | 1323.301689 | 11255.388399 |
Last rows
| Type | Postal District | Market Segment | Type of Sale | Price ($) | Area (Sqft) | Type of Area | Floor Level | Unit Price ($psf) | tenure_yrs_clean | lease_commencement | sale_yr | remaining_lease | nearest_mrt | nearest_mrt_dist | nearest_mall | nearest_mall_dist | nearest_cbd_dist | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 116137 | Apartment | 23 | OCR | Resale | 980000 | 1302 | Strata | 16 to 20 | 752 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116138 | Apartment | 23 | OCR | Resale | 750000 | 915 | Strata | 06 to 10 | 820 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116139 | Apartment | 23 | OCR | Resale | 750000 | 904 | Strata | 16 to 20 | 829 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116140 | Apartment | 23 | OCR | Resale | 760000 | 915 | Strata | 06 to 10 | 831 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116141 | Apartment | 23 | OCR | Resale | 700000 | 807 | Strata | 11 to 15 | 867 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116142 | Apartment | 23 | OCR | Resale | 735000 | 818 | Strata | 06 to 10 | 898 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116143 | Apartment | 23 | OCR | Resale | 785000 | 915 | Strata | 11 to 15 | 858 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116144 | Apartment | 23 | OCR | Resale | 720000 | 818 | Strata | 06 to 10 | 880 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116145 | Apartment | 23 | OCR | Resale | 900000 | 1313 | Strata | 06 to 10 | 685 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
| 116146 | Apartment | 23 | OCR | Resale | 805000 | 915 | Strata | 16 to 20 | 880 | 99.0 | 1994.0 | 2016 | 77.0 | Bukit Panjang LRT Station | 212.964958 | Hillion Mall | 265.365483 | 14337.480919 |
Most frequent
| Type | Postal District | Market Segment | Type of Sale | Price ($) | Area (Sqft) | Type of Area | Floor Level | Unit Price ($psf) | tenure_yrs_clean | lease_commencement | sale_yr | remaining_lease | nearest_mrt | nearest_mrt_dist | nearest_mall | nearest_mall_dist | nearest_cbd_dist | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 701 | Apartment | 13 | RCR | New Sale | 1796000 | 958 | Strata | 06 to 10 | 1875 | 99.0 | 2017.0 | 2020 | 96.0 | Woodleigh MRT Station | 103.796837 | The Poiz | 908.624530 | 6715.328380 | 20 |
| 2045 | Condominium | 5 | OCR | New Sale | 550000 | 463 | Strata | 11 to 15 | 1188 | 99.0 | 2015.0 | 2016 | 98.0 | Clementi MRT Station | 1543.726335 | The Clementi Mall | 1376.926421 | 11759.623548 | 19 |
| 2065 | Condominium | 5 | OCR | New Sale | 725000 | 603 | Strata | 11 to 15 | 1203 | 99.0 | 2015.0 | 2016 | 98.0 | Clementi MRT Station | 1543.726335 | The Clementi Mall | 1376.926421 | 11759.623548 | 18 |
| 2043 | Condominium | 5 | OCR | New Sale | 550000 | 463 | Strata | 06 to 10 | 1188 | 99.0 | 2015.0 | 2016 | 98.0 | Clementi MRT Station | 1543.726335 | The Clementi Mall | 1376.926421 | 11759.623548 | 17 |
| 90 | Apartment | 3 | RCR | New Sale | 1180000 | 431 | Strata | 31 to 35 | 2741 | 99.0 | 2019.0 | 2020 | 98.0 | Outram Park MRT Station | 198.124702 | People's Park Complex | 372.345357 | 1306.394630 | 11 |
| 482 | Apartment | 7 | CCR | New Sale | 1002000 | 409 | Strata | 01 to 05 | 2450 | 99.0 | 2019.0 | 2020 | 98.0 | Bugis MRT Station | 356.917008 | Bugis Cube | 35.853066 | 1925.187790 | 10 |
| 483 | Apartment | 7 | CCR | New Sale | 1022400 | 409 | Strata | 01 to 05 | 2500 | 99.0 | 2019.0 | 2020 | 98.0 | Bugis MRT Station | 356.917008 | Bugis Cube | 35.853066 | 1925.187790 | 10 |
| 560 | Apartment | 10 | CCR | New Sale | 1118000 | 484 | Strata | 06 to 10 | 2308 | 99.0 | 2018.0 | 2020 | 97.0 | Sixth Avenue MRT Station | 170.734619 | Grandstand | 918.931949 | 8076.427804 | 10 |
| 2395 | Condominium | 13 | RCR | New Sale | 793000 | 463 | Strata | 01 to 05 | 1713 | 99.0 | 2017.0 | 2018 | 98.0 | Woodleigh MRT Station | 109.254323 | The Poiz | 846.094002 | 6676.117205 | 10 |
| 2876 | Condominium | 18 | OCR | New Sale | 653000 | 463 | Strata | 01 to 05 | 1411 | 99.0 | 2018.0 | 2019 | 98.0 | Simei MRT Station | 645.759148 | Eastpoint Mall | 567.074620 | 12914.580599 | 10 |